Strong correspondence principle for joint measurement of conjugate observables 
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It is demonstrated that the the statistics for a joint measurement of two conjugate variables in 
Quantum Mechanics are expressed through an equation identical to the classical one, provided that 
joint classical probabilities are substituted by Wigner functions and that the interaction between 
system and detectors is accounted for. This constitutes an extension of Ehrenfest correspondence 
principle, and it is thereby dubbed strong correspondence principle. Furthermore, it is proved that 
the detectors provide an additive term to all the cumulants, and that if they are prepared in a 
Gaussian state they only contribute to the first and second cumulant. 
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Simultaneous knowledge of two conjugate observables 
is a discriminating feature between Classical and Quan- 
tum Mechanics: it is possible, at least in principle, in the 
former, and impossible in the latter. For Quantum Me- 
chanics this should be interpreted as the non-feasibility of 
knowing with arbitrary precision both observables, not as 
the impossibility of realizing joint measurements. Indeed, 
in a pioneering work, Arthurs and Kelly [l| demonstrated 
that a scheme exists for joint measurement of position 
and momentum of a particle, albeit at the cost of sac- 
rificing the precision of both measurements. In [l[ very 
specific hypotheses were made about the initial prepara- 
tion of the detectors. Later on, these assumptions were 
relaxed @, and the connection of the joint probabil- 
ity to the Husimi Q function Q was found. Remark- 
ably, joint measurements of conjugate observables have 
been realized in optics, either through multiport tech- 
niques [lQ] or homodyne detection Furthermore, it 
was demonstrated [B| that in the joint measurement the 
product of the uncertainty of the conjugate variables is 
not less than h, twice as much as the limit of the uncer- 
tainty principle, which applies instead to separate mea- 
surements of position and momentum. This result was 
followed by several papers discussing the interpretation 
of the uncertainty relations starting from the 90s Q up 
to recent years [7j. On another front, the conditional 
state of the system after the measurement was studied 
[H, Q. Ref. @, however, assumed that the statistics is 
fully determined by the observed values and the associ- 
ated spreads, from which a gaussian function was con- 
structed. This in general is not correct, since the statis- 
tics of the outcomes is not always gaussian, and higher 
order cumulants are needed to characterize it. 

In the present work I shall prove that the formula giv- 
ing the probability density for the output of the detectors 
is identical to the one which is obtained in the classi- 
cal case, provided that joint probabilities for the system 
observables and for the detectors observables are substi- 
tuted by the Wigner quasi-probabilities functions. This 
result goes beyond Ehrenfest theorem, which applies to 
average values only. Analogously, I demonstrate that, 
considering the conditional state of the system given the 
outcomes, the formula which expresses such a state in 



terms of Wigner function is identical to the classical equa- 
tion. Also, the more general expression for the condi- 
tional state of the system is derived, and it differs from 
the one surmised in Q. 

I consider detection of two conjugate variables, denoted 
Q, K, with units such that [Q, K] = i. The interaction 
between detectors and system is written 

H in t = S(t - to) X A$AA (1) 

A=Q,K 

with <f>A an operator on the ^4-th detector Hilbert space, 
and A the observables of the system, \a coupling con- 
stants and to time of the measurement. This coupling 
corresponds to the standard von Neumann detection 
scheme Q, duplicated to allow joint measurement of 
non-commuting observables. As such it is known as the 
Arthurs-Kelly scheme, after the authors who introduced 
it for the first time [l|. For a single measurement (e.g. 
A/< = 0), in order to have an ideal measurement, (i) 
initially, the density matrix of detector and system is 
P = Ps <8 PA', (h) the detector must be prepared in a 
sharp state pdetiJ-A, I'a) — \^A = 0)(Ia = 0|, peaked 
around I a = 0. 

Indeed, under assumptions (i) and (ii), after the inter- 
action with the system the probability distribution for 
the variable A is 

U(I A \A) = J dii{a)5{I A - X A a)(a\p s (to)\a), (2) 

a, | a) being eigenvalues and eigenstates of A, and p(a) 
a measure. Eq. ^ implies that in any single measure- 
ment 1a will be found to have one of the values A^et. 
We can relax hypothesis (ii) by requiring, more realisti- 
cally, that (ii') initially the detector is prepared in a state 
Pa{Ia,I'a)i wrtn nfJ^) := pa(Ia,Ia) peaked around 
I A = 0. Then Eq. @ becomes the convolution 



H{I A \A) = J d f i(a)U A {lA-XAa)(a\ps(to)\a). (3) 
In terms of characteristic functions 

Za{xa;^) = z s (xA;t )z A (xA;to)- (4) 
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Actually, one can consider a finite-time measurement, i.e. 
substitute the delta- function with a regular function g(t), 
vanishing outside a finite time window, under the addi- 
tional assumptions: (iii) the Hamiltonian of the detec- 
tor depends only on I a, the variables conjugated to 4>a, 
so that 4>a, Ib J = Sabi^', and (iv) the observable A 
is conserved during the free evolution of the system (at 
least approximately). The measurement is then a non- 
demolition one [12J. Since I am interested in measuring 
non-commuting observables, the only Hamiltonian con- 
serving simultaneously Q and K would be the trivial zero 
Hamiltonian, thus I keep the instantaneous interaction, 
and make use of assumptions (i) and (ii') throughout the 
rest of this paper. (In the case of joint measurement 
of spin components, any spin-independent Hamiltonian 
conserves simultaneously all spin components, and then 
one could consider a finite duration measurement.) 

Now, let us consider a simultaneous interaction with 
both detectors, i.e. ^ 0,A — Q,K. In the follow- 
ing I shall rescale the operators to eliminate the cou- 
pling constants and Planck's constant: Xa^a/^ 4>Ai 
Ia/Xa —> I a- This way Iq,<S>k have the same dimen- 
sions as Q, while Ik,&q have the same dimensions as 
K. The formalism of quantum mechanics allows to ask 
what will be the probability density for Iq,Ik- Indeed, 
applying Born's rule to the time evolution of the total 
density matrix, we have 



where we dropped the functional dependence for brevity 
and defined Va± ■= exp (i($A ± xa/2)a| 

We can use the position eigenstates to express the 
trace, giving 

k({$a}Axa}) = e ^*«-***«)/ 2 ZJ{ X k,Xq\ (10) 

with 



Z^(q,k) :-- 



n({I A }) = Tr s { {I A \e l ^ $* A p~ e~ l ^ ** X 



\Ia)}, (5) 



with p~ := ps<S>pdet, the density matrix of the system and 
detectors immediately before io- After introducing twice 
the identity over the detectors Hilbert space in terms of 
eigenstates of 4>a , we have the probability density 



= / dQe^ Ps (Q + q/2,Q~q/2) 

(11) 

the Fourier transform of Wigner quasi-probability 
U^(K, Q). I changed the sign of the second variable 
so that Zg (0, x) is the generating function for the prob- 
ability lis (If) = (Q\ps\Q), and Zf(x,0) is the gener- 
ating function for the probability Ils(K) = (K\ps\K). 
If the Wigner quasi-probability were positive defined, 
Z^(xk,Xq) would be the corresponding characteristic 
function. Since this is not in general the case, I shall call 
Zg a quasi-characteristic function. 

It follows readily from Eqs. (|7|) and (|10p that the char- 
acteristic function is 

z(xk, xo) = z J{xk, XQ)z™ t (xQ, ^;xk, (1 2 ) 

where 

z YetiXQ, *q; Xk, Ik) ■= KfetiXQ, -*Q5 Xk, -ik) 
= Jd<S> Q d<S> K e^^ AlA ({<S> A + XA/2}\p det \{$>A-XA/2}) 

(13) 

is the quasi-characteristic function of the detectors. 

Eq. (IT21) should be contrasted with the case when first 
a measurement of K is made, and shortly thereafter Q is 
observed: 



n«M> =/ (lit 1 ) «-=.»'•*«*«*,», (0 Z<(XK , XQ) =Z¥( X K,Xe)ZZ t (XcO; XK ,- XQ ), (14) 



where the generating function is 

Zk,q({xa})= J (li^A K({$a},{xa}) 

xp def ({$, + |,$,-|}), (7) 

with the kernel 

K{{* a },{xa})=T±s{v+PsV1}, (8) 

and V± :=ex.p{iJ2 A (®A±XA/2)Ay 

Since the operators Q, K do not commute with each 
other, the kernel in Eq. ([7]) will not be in general a func- 
tion of xa only. Eq. (JSJ) can be rewritten, after applying 
Baker-Campbell-Hausdorff formula, 

K = e i(*KX Q +XK* Q )/i Tr s {v Q+ V K+ p s Vl_V^} , (9) 



or viceversa 



Z>(xk,Xq) =Z s (XK,XQ)ZZt(XQ,XK]XK,0). (15) 

An important conclusion that can be drawn from 
Eq. (fT2"j) is that the contribution of the detectors to all 
the cumulants (defined as logarithmic derivatives of Z 
calculated at xq = ®iXK — 0) is simply an additive 
term. In particular, if the detectors are prepared in a 
gaussian state, so is Zdet, and thus their contribution to 
the cumulants higher than the second one vanish. 

The probability density is obtained by Fourier trans- 
forming Eq. (|12[) . and it consists in a convolution of the 
Wigner quasi-probability densities: 

u(i Q ,i K )= J dr Q dr K d& Q d& K n% t (i' Q ,& Q ;i' K ,& K ) 
x nY(i K -i' K -^,i Q -i' Q +^f). (i6) 
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A priori, it is not obvious that the convolution of two ar- 
bitrary quasi-probability functions presented in Eq. (|16[) 
is positive-defined. However, U(Iq,Ik) is positive by 
construction, thus an interesting mathematical corollary 
follows from the derivation presented above: given any 
two quasi-probability functions, their convolution as de- 
fined in Eq. (|16|) gives a proper probability distribution. 
In particular, one can consider the detectors to be ini- 
tially independent of one another, so that 

U(I Q ,I K )= J dI Q dI' K d^ Q d^ K IL^(I Q ^' Q )U^(I' K ,^ K ) 

x nJ(i K -i' K -^,i Q -i' Q +^f). (i?) 

is positive definite for any three Wigner functions. 

Furthermore, Eq. (|16[) would have a simple interpre- 
tation if the Wigner quasi-probabilities were positive- 
defined: before the interaction, the observables of the de- 
tector A possessed the values I' A and <&' A with probability 
n^ t (/g, $q; I' k , <&' k ), and the Q, K variables of the sys- 
tem had the values Q',K', with probability Iljf (Q', K'). 
After the interaction, the value of Iq is shifted determin- 
istically by Q' - & K /2, and that of I K by K' + $' Q /2. 
It is interesting to note that this is indeed the result one 
would obtain in the classical case, if the interaction term 
is given by Eq. ([T]). After solving the classical Hamil- 
tonian equations, the value of Iq,Ik immediately after 
the interaction are (primed quantities are calculated at 
<q = to — e, and unprimed ones at tg~ = to + e) 



where the indexed brackets indicate averaging over the 
density matrices of system and detectors before the inter- 
action, while the unindexed ones indicate averaging over 
the probability H(Iq,Ik) given in Eq. (Tl6l) . We notice 
that Eqs. (TTgj) are identical in form to Eqs. This 
is a consequence of the Ehrcnfcst theorem, which implies 
that, for quadratic Hamiltonians, the equations of mo- 
tion for the average values of an observable are identical 
with the corresponding classical equations. The formal 
identity of Eq. (|16[) in the classical and quantum case, 
however, is a new result, going well beyond Ehrenfest the- 
orem. I call this result the strong correspondence prin- 
ciple. It amounts to a simple prescription: (i) solve the 
classical equations of motion for the interaction between 
detectors and system; (ii) assume an initial joint prob- 
ability distribution H(Iq, $q; Ik , $k ) for the detectors 
and II(Q, K) for the system; (iii) find the joint probabil- 
ity of observing outcomes Iq , Ik in terms of the initial 
probabilities; (iv) substitute the classical probability dis- 
tributions with the Wigner quasi-probability ones. 

Let us consider the state of the system after the 
detection, conditioned on the fact that the readout 
of the detectors was Iq,Ik- Rather than work- 
ing with the density matrix, I consider the condi- 
tional quasi-characteristic function of the system. The 
unnormalized conditional quasi-charateristic function 
TI(Iq,Ik)ZY{q, k\lQ,lK) can be found from Eqs. © by 
substituting the kernel in (|8]) with 



Iq =Q' 
Ik =K' 



I'k 



Q = Q' - 
K = K' 



from which Eq. (|16p readily follows. 
It follows from Eq. that 



Ki 



(Iq}=(Q)s + (Iq)q-($k)k/2, 
(Ik) =(K)s + (Ik)k + ($q)q/2, 



(18a) 
(18b) 



(19a) 
(19b) 



K= / dQe* k Q(Q + l\V +Ps vl\Q-l). (21) 



This readily gives 



and 



(AI^) =(&Q 2 )s + (&I 2 q)q + - a (^1)k + (AI Q A* K ), 

(20a) 

(All) =(&K 2 ) S + (AP k ) k + \(A& Q ) Q + (AI K A$ Q ), 

(20b) 



Zf{q, k\I Q ,I K ) = J dxQdxK 



cxp [-i(xqIq +XkIk)} 

WqJk) 



x Zf{q + X K,k + XQ)Z% t ( X Q,q + XK, -k - f) 



and the corresponding quasi-probability distribution is 



r 



or It has a simple classical interpretation: according to 

U(I Q ,I K )Ilf(K,Q\I Q ,I K ) = J dQ'dK'Uf(K',Q') 



xUZ t (I Q -^±^,K-K'-,I K -^±^,Q'-Q). (23) 
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Bayes' theorem, the conditional probability of finding the 
system with values Q,K, given that the detectors gave 
the output Iq , Ik satisfies 

U(I q ,I k )II s (Q,K\Iq,I k ) = IL(Q,K,I Q ,I K ) (24) 

The classical joint probability n(<3, K, Iq,Ik) can be de- 
rived from the classical equations of motion Eqs. (TT8)) 
through the following reasoning: for given $q,$j<-, the 
values of the system before the interaction must be 
Q + K - $q; this happens with probability II <f (K — 
®q,Q + &k)- The values of Iq,Ik before the interaction 
must have been I' Q = Iq-Q-$> k /2,I' k = I K -K+$ Q /2, 
and arbitrary <&q,<&k] this happens with probability 
Il det (I Q -Q~<I> K /2,<f> Q ;I K ~K+<S> Q /2,<I> K ). Integrating 
over all possible values $q , $k gives Eq. (|22p . Thus we 
have a further application of the strong correspondence 
principle: one could derive the joint conditional probabil- 
ity through the classical reasoning, and then substitute in 
the formulas the Wigner quasi-probabilities distributions 
of detectors and system for the positive defined classical 
probabilities. 

We notice that, for general preparation of the detec- 
tors, the conditional state of the system depends on its 
initial state, and it is not a gaussian, contrary to what 
was concluded in Q applying the principle of maximum 
entropy. 

In conclusion, I have demonstrated a rich correspon- 
dence between classical and quantum mechanics: not 
only do the average values of an observable obey the clas- 
sical equations of motion, as established by Ehrenfest's 
theorem, but the full joint probability of the outcomes 
has the same expression in the classical as in the quan- 
tum case in terms of the initial probability distributions 
of the dynamical variables of system and detectors, pro- 
vided that one substitutes the classical joint probabilities 
with Wigner quasi-probabilities. Due to the uncertainty 
relations, the Wigner quasi-probabilities come in such 
combinations that they give rise to a positive probabil- 
ity distribution. A fecund concept was also introduced, 
that of quasi-characteristic function, in terms of which 



the characteristic function of the joint outcomes has a 
remarkably simple expression [see Eq. (|12[) ]. From this, 
one can conclude that detectors contribute with an ad- 
ditive term to the cumulants of all orders. The strong 
correspondence between the classical and the quantum 
case was shown to hold also for the determination of the 
conditional state of the system after the measurement. 
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